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(57) Abstract 

The process according to the present invention allows expression and isolation of polypeptides with the proteolytic activity of HCV 
NS3 protease in a pure, catalytically active form, and in amounts that are sufficient for discovery of NS3 protease inhibitors and for 
determination of the three-dimensional structure of the NS3 protease. A further subject of the present invention is a procedure that defines 
the chemical and physical conditions necessary for completion of the proteolytic activity of the above polypeptides. The invention further 
comprises new compositions of matter (expression vectors) containing nucleotide sequences capable of expressing the above mentioned 
polypeptides in culture cells. Finally, new compounds of matter are defined, suitable to measure the above proteolytic activity, and useful 
to develop NS3 protease inhibitors and therefore therapeutic agents for use against HCV. The figure shows the kinetic parameters of HCV 
NS3 protease using the S3 depsipeptide substrate (SEQ ID MO: 45). 
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METHODOLOGY TO PRODUCE, PURIFY AND ASSAY POLYPEPTIDES 
WITH THE PROTEOLYTIC ACTIVITY OF THE HCV NS3 PROTEASE 

DESCIBTPTION 

The present invention relates to molecular biology 
and to hepatitis C virus (HCV) virology. More 
specifically, the invention has as its subject a process 
for producing, in a pure form and in high quantities, 
polypeptides having the proteolytic activity of HCV NS3 
protease, and a method for the effective reproduction in 
vitr^o of the proteolytic activity of these polypeptides 
in order to define an enzymatic assay capable of 
selecting, for therapeutic purposes, compounds inhibiting 
the enzyme activity associated with NS3 , 

As is known, the hepatitis C virus (HCV) is the main 
etiological agent of non-A, non-B hepatitis (NANB) . It is 
estimated that HCV causes at least 90% of post- 
transfusional NANB viral hepatitis and 50% of sporadic 
NANB hepatitis. Although great progress has been made in 
the selection of blood donors and in the immunological 
characterisation of blood used for transfusions, there is 
still a high number of HCV infections among recipients of 
blood transfusions (one million or more infections every 
year throughout the world) . Approximately 50% of HCV- 
infected individuals develop liver cirrhosis within a 
period that can range from 5 to 4 0 years. Furthermore, 
recent clinical studies suggest that there is a 
correlation between chronic HCV infection and the 
development of hepatocellular carcinoma. 

HCV is an enveloped virus containing an RNA positive 
genome of approximately 9.4 kb. This virus is a member of 
the Flaviviridae family, the other members of which are 
the flaviviruses and the pestiviruses . 

The RNA genome of HCV has recently been mapped. 
Comparison of sequences from the HCV genomes isolated in 
various parts of the-world has shown that these secpiences 
can be extremely heterogeneous- The majority of the HCV 
genome is occupied by an open reading frame (ORF) that 
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can vary between 9030 and 9099 nucleotides. This ORF 
codes for a single viral polyprotein, the length of which. 
can vary from 3010 to 3 033 amino acids. During the viral 
infection cycle, the polyprotein is proteolytically 
5 processed into the individual gene products necessary for 
replication of the virus- 

The genes coding for HCV structural proteins are 
located at the 5 '-end of the ORF, whereas the region 
coding for the non- structural proteins occupies the rest 

10 of the ORF. 

The structural proteins consist of C (core, 21 kDa) , 
El (envelope, gp37) and E2 (NSl, gp61) . C is a non- 
glycosylated protein of 21 kDa which probably forms the 
viral nucleocapsid. The protein El is a glycoprotein of 

15 approximately 37 kDa, which is believed to be a 
structural protein for the outer viral envelope. E2, 
another membrane glycoprotein of 61 kDa, is probably a 
second structural protein in the outer envelope of the 
virus. 

20 The non- structural region starts with NS2 (p24) , a 

hydrophobic protein of 24 kDa whose function is unknown. 

NS3, a protein of 68 kDa which follows NS2 in the 
polyprotein, is predicted to have two functional domains: 
a serine protease domain within the first 200 amino- 

25 terminal amino acids, and an RNA-dependent ATPase domain 
at the carboxy terminus . 

The NS4 gene region codes for NS4A (p6) and NS4B 
(p26) , two hydrophobic proteins of 6 and 26 kDa, 
respectively, whose functions have not yet been fully 

30 clarified. 

The NS5 gene region also codes for two proteins, 
NS5A (p56) and NS5B (p65) , of 56 and 65 kDa, 
respectively- Amino acid sequences present in all the 
RNA-dependent RNA polymerases can be recognised within 

35 the NS5 region. This suggests that the NS5 region 
contains components of the viral replication machinery. 
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Various molecular biological studies indicate that 
the signal peptidase, a protease associated with the 
endoplasmic reticulum of the host cell, is responsible 
for proteolytic processing in the non- structural region, 
5 that is to say at sites C/El, E1/E2 and E2/NS2. 

The serine protease in NS3 is responsible for 
cleavage at the junctions between NS3 and NS4A, between 
NS4A and NS4B, between NS4B and NS5A and between NS5A and 
NS5B- In particular it has been found that the cleavage 

10 made by this serine protease leaves a cysteine or a 
treonine residue on the amino -terminal side (position PI) 
and an alanine or serine residue on the carboxy- terminal 
side (position PI*) of the cleavage site. It has been 
shown that the protease contained in NS3 is a 

15 heterodimeric protein in vivo, forming a complex with the 
protein NS4A. Formation of this complex increases 
proteolytic activity on sites . NS4A/NS4B and NS5A/NS5B, 
and is a necessary requisite for proteolytic processing 
of site NS4B/NS5A. 

20 A second protease activity of HCV appears to be 

responsible for the cleavage between NS2 ajid NS3 • This 
protease activity is contained in a region comprising 
both part of NS2 and the portion of NS3 containing the 
serine protease domain, but does not use the same 

25 catalytic mechanism as the latter. 

A s\ibstance capable of interfering with the 
proteolytic activity associated with the protein NS3 
might constitute a new therapeutic agent. In effect, 
inhibition of this protease activity would involve 

30 stopping the proteolytic processing of the non- structural 
region of the HCV polyprotein and, consequently, would 
prevent viral replication of the infected cells. 

This sequence of events has been verified for the 
homologous flavivirus, which, unlike HCV, infects cell 

35 line cultures. In this case, it has been shown that- 
genetic manipulations involving generation of a protease 
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no longer capable of carrying out its catalytic activity, 
abolishes the ability of the virus to replicate (1) . 

Furthermore, it has been widely shown, both in vitro 
and in clinical studies, that compounds capable of 
5 interfering with the HIV protease activity are capable of 
inhibiting replication of this virus (2) , 

The methods used to generate molecules with 
therapeutic potential are known to those operating in 
this field. Generally speaking, collections of compounds 

10 containing a large number of single chemical entities 
with a high molecular diversity are made to undergo an 
automatised assay in order to identify single active 
agents, which then undergo further chemical modifications 
in order to improve their therapeutic potential. Other 

15 approaches may include rational modification of 
substrates or ligands of specific target protein, with 
the aim of developing high binding affinity conqpounds 
capsdDle of altering or abolishing the. biological activity 
of the protein under examination. Determination of the 

20 three-dimensiona3-. structure of a target protein, by meauis 
of methods known in the sector as X-ray crystallography 
or nuclear magnetic resonance (NMR) allows rational 
design of molecules capable of binding specifically to 
. the protein and which, as a result of this, have the 

25 ability to interfere with the biological properties of 
that protein. 

Research on compounds capable of interfering with 
the biological activity of the protease contained in the 
hepatitis C virus NS3 protein is hampered by the 

30 difficulty in producing sufficient amounts of purified 
protein with unaltered catalytic properties, and by the 
need to use co- factors to enhance the activity of the 
enzyme in vitro. 

There is therefore . a need in the specific field for 

35 a process to produce NS3, or similar products, in larger 
amoiints that has been possible in the past, and with an 
in vitro activity sufficient to select inhibitors. 
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The present invention consists of isolated suid 
purified polypeptides, with the proteolytic activity of 
the HCV protein NS3, characterised by the fact that they 
have an amino acid sequence chosen from among the 
5 sequences SEQ ID N0:1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID 
NO: 4 and SEQ ID NO:5. 

The invention also comprises expression vectors - to 
produce the polypeptides represented by sequences SEQ ID 
N0:1, SEQ ID NO:2, SEQ ID N0:3, SEQ ID N0:4 and SEQ ID 
10 NO: 5 which have the proteolytic activity of HCV NS3 - 
comprising: 

a polynucleotide coding for one of said 
polypeptides ; 

functional regulation, transcription and 
15 translation secjuences in said host cell, operatively 
bonded to said polynucleotide coding for one of said 
polypeptides; and 

optionally, a selectable marker. 
The invention also extends to a host cell, either 
20 - eukaryotic or prokaryotic, transformed using an 
expression vector containing a DNA sequence coding for 
SEQ ID N0:1, SEQ ID N0:2, SEQ ID NO:3, SEQ ID NO:4 and 
SEQ ID NO: 5 in such a way as to allow said host cell to 
express the specific coded polypeptide in the chosen 
25 sequence. The invention further comprises a process for 
preparation of polypeptides with sequence selected from 
the group comprising SEQ ID NO:l, SEQ ID NO: 2, SEQ ID 
NO: 3, SEQ ID NO: 4 and SEQ ID NO: 5, characterised by the 
fact that it comprises, in combination, the following 
30 operations : 

- transformation of a host cell, either eukaryotic 
or prokaryotic, using one of the expression vectors 
mentioned above; and 

- expression of the desired nucleotide sequence to 
35 produce the chosen polypeptide; and 

- purification of the polypeptide thus obtained, 
avoiding resolubilisation protocols . 
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The present invention also has as its object a 
method for reproducing in vitro the proteolytic activity 
of the HCV NS3 protease, characterised by the fact that 
the activity of purified . polypeptides, with sequences 
5 chosen from the group comprising SEQ ID N0;1, SEQ ID 
N0:2, SEQ ID NO : 3 , SEQ ID NO: 4 and SEQ ID NO: 5, similar 
to NS3, is reproduced in a solution containing 30-70 mM 
Tris pH 6.5-8.5, 3-30 raM dithiotreitol (DTT) , 0.5-3% 3- 
[ ( 3 -colammide -propyl) -dimethyl -ammonium] -1- 

10 propansulphonate (CHAPS) and 30-70% glycerol at 
temperatures of between 20 and 25 ®C and by the fact that 
in these conditions the activity of the aJoave mentioned 
polypeptides can be kinetically determined and quantified 
on peptide substrates even in the absence of co-factors. 

15 An assay of the protease activity of the 

polypeptides SEQ ID NO:l, SEQ ID NO: 2, SEQ ID NO: 3, SEQ 
ID NO: 4 and SEQ ID NO: 5 can be performed by cleaving a 
substrate providing detectable products. The cleavage is 
preferably detected using methods based on radioactive, 

20 colorimetric or fluorimetric signals. Methods such as 
HPLC and the like are also suitable. According to the 
present invention, the sixbstrates used are synthetic 
peptides corresponding to the HCV polyprotein NS4A/4B 
junction. If necessary, peptides containing the amino 

25 acid sequence SEQ ID NO: 6, or parts thereof, can be used 
as co-factor of the NS3 protease. 

Peptides suitable for use as substrates are the 
peptide represented by the sequence SEQ ID NO: 7 and 
derivatives thereof with N and/or C-terminal deletions 

30 (SEQ ID NOS:8-12, 14, 18-20) and the peptide represented 
by the sequence SEQ ID NO:47. Particularly suitable are 
the decapeptides represented by the sequences SEQ ID 
NOS:18-20, especially SEQ IS NO: 18 and the sequences 
derived therefrom SEQ ID NOS:29-32, 35. . 

35 These peptides can be used for a high- throughput 

assay of NS3 protease activity at a concentration of the 
latter of between 100-200 nM. 
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According to the invention depsipeptide substrates 
(peptides with at least one ester bond in the sequences) 
can also be used advantageously for a high- throughput 
assay of the activity of the NS3 protease. It is, in 
5 fact, known that it is desirable to run the assay at the 
lowest possible enzyme concentration compatible with 
sufficient substrate conversion. This maximises 
sensitivity to inhibition and allows to screen for 
inhibitors which are present at very low concentrations 

10 in compound mixtures or combinatorial libraries. 
Substrates for NS3 protease with a standard amide at the 
scissile bond between residues PI and PI' have Kcat/K« 
values between 30-100 M*^ s*^. This sets a practical range 
of enzyme concentration for a high -throughput assay of 

15 100-200 nM. To lower this concentration it is necessary 
to use s'ubstrates with higher K^at/Kn, values. Sxibstrates 
containing an ester bond between PI and PI' are ideally 
suited for this, since formation of the acyl -enzyme 
intermediate is accomplished much more readily due to the 

20 more thermodynamically favourable transesterif ication 
reaction (8) . The depsipeptide substrates according to 
the invention have very high K^at/Km values, cOid this 
brings the useful rsuige of NS3 concentration in the high- 
throughput assay to 0.5-2 nM. These substrates may be 

25 synthesised in high yield on solid-phase by standard 
chemical methodology. 

Conventional assays are suitable for high throughput 
screening, but they recjuire hydrolysis of at least 10% of 
the substrate before the product can be detected 

30 conveniently. This precludes determination of trxie 
initial rates, which are important for accurate kinetic 
studies. To overcome these difficulties, an assay has 
been developed that allows continuous monitoring of 
protease activity. The assay relies on specially tailored 

35 synthetic substrates, which are capable of direct-, 
continuous signal generation that is directly 
proportional to the extent of substrate hydrolysis, thus 
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avoiding the need for separation of the substrate from 
the reaction product. The depsipeptides used (SEQ ID 
NOS:45 and 46), the chemical formulas of which are given 
in figure 12, are internally quenched fluorogenic 
5 substrates based on resonance energy transfer (RET) - They 
contain a fluorescent donor, 5-[(2'- 

aminoethyl) amino] naphthalenesulfonic acid (EDANS) , near 
one end of the peptide, and an acceptor group, 4-[[4'- 
(dimethylamino) phenyl] azo] benzoic acid (DABCYL) near the 

10 other end. The fluorescence of this type of substrate is 
initially quenched by intramolecular RET between the 
donor and the acceptor, but as the enzyme cleaves the 
substrate the fluorescence increases . EDANS and DABCYL 
were selected as donor/acceptor pair because of the 

15 excellent spectral overlap between the fluorescent 
emission of the former and the absorption of the latter 
(13-17) . RET efficiency depends on the distance between 
the donor and the acceptor, i.e, the closer the two, the 
higher the c[uenching. For the EDANS/DABCYL couple, the 

20 Forster distance for 50% energy— transfer (Rq) is 33 A. 
The maximum distance between EDANS/DABCYL reported in a 
s\ibstrate is 11 amino acids (Id) which, assuming an 
extended confoxroation for the peptide, corresponds to 
R«39.8 A, with a calculated RET efficiency of 24*5%. This 

25 corresponds to a 10 -fold increase in fluorescence upon 
substrate cleavage. 

Up to this point a general description has been 
given of the present invention. With the aid of the 
following examples, a more detailed description of 

30 specific embodiments thereof will now be given, in order 
to give a better understanding of the aims, 
characteristics, advantages and operation methods of the 
invention. 

Figure 1 shows the plasmid vector used for transfer 
35 -and expression of the polypeptide represented by SEQ ID 
NO:! in Spodoptera frugiperda clone 9 cells. 
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Figures 2A and 2B show the plasmid vectors for 
transfer and expression in E. coli of the polypeptides 
represented by sequences SEQ ID NO: 2, SEQ ID NO:3, SEQ ID 
NO: 4 and SEQ ID NO: 5, respectively. 
5 Figure 3 shows NS3 activity as a function of the 

concentration of glycerol . 

Figure 4 shows NS3 activity as a function of the 
concentration of CHAPS, 3- [ (3-colammide-propyl) -dimethyl - 
ammonium] -1-propansulphonate. 
10 Figure 5 shows NS3 activity as a function of pH, 

Figure 6 shows NS3 activity as a function of ionic 
strength . 

Figure 7 shows a diagram of the enzymatic assay to 
measure NS3 activity using as a substrate a peptide Ac- 

15 Asp-Glu-Met-Glu-Glu-Cys-Ala-Ser-His-Leu-Pro-Tyr-Liys-e- 
(3") -Ac (SEQ ID NO:47). 

Figure- 8 shows the reaction diagram for synthesis of 
the depsipeptide substrate SI represented by. the sequence 
SEQ ID NO: 42. 

20 Figure 9 shows the reaction diagram for synthesis of 

the depsipeptide substrate S2 represented by the sequence 
SEQ ID NO: 43. 

Figure 10 shows the reaction diagram for synthesis 
of the radioactive depsipeptide substrate SI represented 
25 by the seG[uence SEQ ID NO: 44. 

Figure 11 shows a high -throughput assay, based on 
radioactive signals, to determine NS3 protease activity. 

Figure 12 shows the chemical formula of the 
depsipeptide substrates (SEQ ID NO: 45 and SEQ ID NO: 46) 
30 for a continuous assay of NS3 activity based on RET 
intramolecular fluorescence cjuenching. 

Figure 13 shows the reaction diagram for synthesis 
of the depsipeptide substrate S3 (SEQ ID NO:45) . 

Figures 14A and 14B show, respectively, the kinetic 
35 parameters for the NS3 protease with the substrate S3 
(SEQ ID NO: 45) and fluorescence as a function of time in 
the relevant assay. 
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EXAMPLE 1 

Method of emression of HCV NS3 nrnt^g>asf^ -in .gporfopf^^n^^ 
^rv^lp^r-da nlnn& 9 cultiured cells. 

Systems for esq^ression of foreign genes in insect 
5 cultured cells, such as Spodoptera frugiperda clone 9 
(Sf9) cells infected with baculovirus vectors are known 
in the art (3) . Heterologous genes are usually placed 
under the control of the strong polyhedrin promoter of 
the Autogrrapha califomica nuclear polyhedrosis virus or 

10 the Boiobix mori nuclear polyhedrosis virus. Methods for 
the introduction of heterologous DNA in the desired site 
in the baculoviral vectors by homologous recombination 
ar:e also known in the art (4) . 

The plasmid vector pBacNS3 (1039-1226) is a 

15 derivative of pBlueBacIII (Invitrogen) and was 
constructed for transfer of a gene coding for a 
polypeptide with the activity of NS3 (1039-1226) . For 
this purpose, the nucleotide sequence coding for this 
polypeptide described in SEQ ID N0:1 was obtained by PGR 

20 . using oligonucleotides that insert an ATG condon at 5 • 
and a TAG stop codon at 3 • in the sequence , The fragment 
obtained in this way was inserted at the BamHl site of 
the vector pBlueBacIII, following treatment with the 
Klenow DNA polymerase fragment. The plasmid is 

25 illustrated in figure 1 . 

Spodoptera fruglperda clone 9 (Sf9) cells and 
baculovirus recombination kits were purchased from 
Invitrogen. Cells were grown on dishes or in suspension 
at 27«C in complete Grace's insect medium (Gibco) 

30 containing 10% foetal bovine serum (Gibco) . Transf action, 
recombination, and selection of baculovirus constructs 
were performed as recommended by the manufacturer. 

For protein expression, Sf 9 cells were infected with 
the recombinant baculovirus at a density of 2 x 10^ cells 

35 per ml in a ratio of about 5 virus particles per cell . 
The cells were cultivated in suspension for 72 hours at 
23«C. Lowering the temperature from 27®C, which 
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corresponds normally to the optimal growth temperature, 
to 23 «C is crucial in order to obtain a solxible and 
active protein. 

After harvesting the cells by centrifugation and 
washing them with PBS (20 mM sodium phosphate pH 7.4, 140 
mM NaCl) the pellet was re -suspended in 25 mM sodium 
phosphate pH 6.5, 20% glycerol, 0.5% 3- [ (3-colammide- 
propyl) -dimethyl -ammonium] -1-propansulphonate (CHAPS) , 10 
mM dithiothreitol (DTT) , 1 mM ethylendiammino-tetracetic 
acid (EDTA) . The cells were destroyed at 4«C by means of 
four cycles of sonication at 10 W with a dxxration of 30 
seconds each, using a Branson 250 instrument. The 
homogenate obtained in this way was pelleted by 
centrifugation at 120,000 x g for one hour and the 
supernatant was loaded onto an HR26/10 S-Sepharose column 
(Pharmacia) balanced with 25 mM sodium phosphate pH 6.5, 
10% glycerol, 2 raN DTT, 1 mM EDTA, 0,1% CHAPS at a flow 
rate of 2 ml/min. After washing with two volumes of 
column the protease was eluted with an NaCl gradient 
between 0 and 1 M. The f ractions_containing the protease 
were identified using Western blotting methodology with 
NS3-specific polyclonal antibodies, concentrated to 3 ml 
using an Amicon ultrafiltration cell equipped with a YmlO 
membrsme and chromatographed onto a Superdex 75 HR26/60 
column (Pharmacia) equilibrated with 50 mM sodium 
phosphate pH 7.5, 10% glycerol, 2 mM DTT, 0.1% CHAPS, 1 
mM ESTA and a flow rate of 1 ml/min. The fractions 
containing the protease were pooled and underwent further 
chromatography on a Mono-S HR5/5 column (Pharmacia) 
equilibrated with the same buffer used in the previous 
column. The protease was eluted in a pure form from this 
column, applying a linear NaCl gradient between 0 and 0.5 
M. The protease was stored at -80°C in 50% glycerol, 0.5% 
CHAPS, 10 mM DTT and 50 mM sodium phosphate pH 7.5. The 
yield of the process is 0.5 mg/1 of cells. The purified 
protein has a catalytic activity Kcac/Ktn=120-200 M-1 s-1 
measured in 50 mM Tris pH 7.5, 50% glycerol, 2% CHAPS, 30 
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- X2 - 

mM DTT at 23*^0 using the peptide substrate Fmoc-Tyr-Gln- 
Glu-Phe-Asp-Glu-Met-Glu-Glu-Cys-Ala-Ser-His-Leu-Pro-Tyr- 
Ile-Glu-Gln-Gly (SEQ ID N0:7) , derived from the 
polyprotein cleavage site between NS4A and NS4B. The 
5 cleavage products deriving from this reaction were 
separated using HPLC, isolated and identified by mass 
spectrometry, confirming that proteolytic cleavage took 
place between cysteine and alanine. The concentration of 
protease necessary to determine activity was between 100 
10 nM and 1.6 ^M. 

EXAMPLE 2 

Method of expression of HCTV NS3 protease i n E. cqIt . 

The plasmids pT7-7(NS3 1039-1226), pT7-7 {NS3 1039- 
1206), pT7-7 (NS3 1027-1206) and pT7-7 (NS3 1033-1206), 

15 described in figures 2A and 2B, were constructed in order 
to allow expression" in E. coll of the polypeptides 
indicated in SEQ ID NO: 2 and SEQ ID NO:3, and SEQ ID NO: 4 
and SEQ ID NO:5, respectively. The protein fragments 
contain variants of the protease domain of the HCV NS3 

20 protein. The respective fragments of HCV cDNA were cloned 
downstream of the bacteriophage T7 010 promoter and in 
frame with the first ATG codon of the phage T7 gene 10 
protein, using methods that are known to the practice. 
The pT7-7 plasmids containing NS3 sequences also contains 

25 the gene for the p-lactamase enzyme that can be used as a 
marker of a selection of £. coll cells transformed with 
these plasmids. 

The plasmids were then transformed in the E, coll 
strain BIj21{DE53), which is normally employed for high- 

30 level expression of genes cloned into expression vectors 
containing the T7 promoter. In this strain of E. coll, 
the T7 polymerase gene is carried on the bacteriophage X 
DE53, which is integrated into the chromosome of BIi21 
cells (5) . Expression from the gene of interest is 
"35 induced by addition of isopropylthiogalactoside (IPTG) to 
the growth medium according to a procedure that has been 
previously described (5) . Over 90% of the proteins 
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expressed using one of the plasmids mentioned above is 
found in an insoluble form in inclusion bodies, from 
which it is possible to obtain a soluble and active 
protein following refolding methods known to the field 
5 (see for example (6)). Refolding protocols have often 
variable yields of catalytically active protein, and they 
require extremely controlled conditions, or cause 
irreversible modifications of the protein (such as 
carbamylation in the presence of urea) , or require 

10 impractical procedures, such as the use of extremely 
diluted protein solutions, or dialysis of exceedingly 
large volumes of samples. 

To avoid these problems, a method has been 
developed, which is described below, for the production 

15 of the HCV protease in a soluble and active form, 
avoiding thus resolubilisation protocols: E. coli BL21 
(DE53) transformed using one of the plasmids mentioned 
above were grown at 37^C until reaching a cell density 
that causes absorption of 0.8 OD (OD stands for optical 

20 density) at 6 00 nm. At this point the temperature was 
lowered to 30<>C in 15-20 minutes and 400 jiM IPTG was 
added to induce expression of the protein. The 
temperature was then lowered further to 22-24^0 within a 
period of 20-30 minutes. The cultures were stirred for a 

25 further 4 hours at this temperature. At this point the 
cells were harvested by centrif ugation and washed using 
PBS. 

Purification mf>thnri 

30 The pellets resulting from the operations described 

above were incubated on ice for 5 minutes and re- 
suspended in 25 mM sodium phosphate pH 6.5, 50% glycerol, 
0.5% CHAPS, 10 mM DTT, 1 mM EDTA (buffer A) pre-cooled to 
4*>C. 10 ml of this buffer was used for each litre of 

35 bacterial culture. After a further 5-10 minutes of 
incubation on ice the cell suspension was homogenised 
using a French press. The resulting homogenate was 
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centrifuged at 120,000 x g. The supematants from this 
centrifugation were preserved on ice, whereas the pellets 
were re-suspended in buffer A (1 ml to each litre of 
bacteria culture). After the addition of 1 mM MgC12 and 
DNasel, the suspension was incxibated for 10 minutes at 
20*»C and re-centrifuged for 1 hour at 120,000 x g. The 
supernatant from this second centrifugation was pooled 
with the first supernatant and the resulting protein 
solution was adsorbed on S-Sepharose (or SP-Sepharose) 
resin (Pharmacia) equilibrated with 25 mM sodium 
phosphate pH 6.5, 10% glycerol, 0.5% CHAPS, 3 mM DTT, 1 
mM EDTA (buffer B) . 10 ml of resin suspended in 5 ml of 
buffer B was used for each litre of bacterial culture. 
The resin was stirred for 1 hour at 4°C, collected by 
filtration, washed with buffer B and poured into an 
appropriate chromatography column. The protease was 
eluted with an NaCl gradient between 0 and 1 M. Fractions 
containing the protease were identified using Western 
blotting, pooled and concentrated using Centriprep 10 
concentrators (Amicon) until reaching a concentration of 
6-10 mg/ml in protein, determined using the BIORAD 
method. Up to 3 ml of this solution was loaded onto a HR 
26/60 Superdex 75 or up to 20 ml was loaded onto an HR 
60/600 Superdex 75 (both Pharmacia:) equilibrated with 50 
mM sodium phosphate pH 7.5, 10% glycerol, 3 mM DTT, 0.5% 
CHAPS (buffer C) and chromatography was carried out at 1 
ml/min {HR26/60) or 5 ml/min {HR60/600) . The fractions 
containing the protease were pooled and further purified 
by chromatography on HR 5/5 Mono S (Pharmacia) 
equilibrated with buffer C. The protease was eluted from 
this column with an NaCl gradient between 0 and 0.5 M. 
Purification to homogeneity was also possible with the 
following modification: after elution from S-Sepharose 
the fractions containing the protease were diluted 1:4 in 
buffer C and loaded onto Heparin - Sepharos e . Elution from 
this resin was obtained with an NaCl gradient between 0 
and 0.5 M. The protein was then chromatographed on 
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hydroxi apatite or Superdex 75 as described above. The 
yield is 1-2 mg of purified protein per litre of 
bacterial culture. 

5 Characterisation of the purified pyofP>in 

The purified protein was characterised by means of 
gel filtration, reverse-phase HPLC, mass spectrometry and 
N-terminal sequence analysis. 

Analytical gel filtration experiments showed that 
10 the protein is monomeric. The protein expressed using 
pT7-7 (NS3 1027-1206) shows three peaks following 
reverse-phase HPLC chromatography. Mass spectrometry 
analysis and determination of the N- terminal sequence 
showed heterogeneity of the N- terminal portion of the 
15 molecule. Three forms were found, having the following N- 
terminal sequences : 

Met-Ala-Pro-Ile-Thr-Ala-Tyr-Ser-Gln-Gln-Thr (form 1) 
Pro-Ile-Thr-Ala-Tyr-Ser-Gln-Gln-Thr (form 2) 

Ser-Gln-Gln-Thr (form 3) 
20 To avoid this problem, two experimental strategies 

were adopted; 

1. Homogenisation in the presence of 100 |ig/ml of the 
chymostatin protease inhibitor. This inhibitor does not 
inhibit HCV protease activity, but it does inhibit the 

25 chymotrypsin type proteases, specific for aromatic 
residues like phenylalanine and tyrosine. In this way it 
was possible to purify a single molecular species with 
more than 95% of form 2. 

2. Production of a protease corresponding to form 3 by 
30 means of the plasmid pT7-7 (NS3 1033-1206) . In this way a 

protein with more than 95% of form 3 was purified. 

EXAHPTiF. 3 

Method for reproducing in vitro the activity of the Hrv 
NS3 protease 

35 Definition of the chemical and physical nonditions 

for reproduction of the activity. 
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The ability of the purified protease to catalyse 
cleavage of the peptide Fmoc-Tyr-Gln-Glu-Phe-Asp-Glu-Met- 
Glu-Glu-Cys-Ala-Ser-His-Leu-Pro-Tyr-Ile-Glu-Gln-Gly (SEQ 
ID N0:7) has been used to define the optimum conditions 
for activity. Cleavage was detected by separating the 
substrate from the hydrolysis products by reverse -phase 
HPLC, For this purpose the mixture containing the buffer 
and the peptide incubated with the protease was injected 
into a reverse-phase Lichrospher RP-18 column (Merck) and 
eluted with an acetonitrile gradient containing 0.1% 
trifluoracetic acid. The cleavage products were 
identified by co-injection of appropriate standards, and 
by mass spectrometry. For these experiments, proteins 
produced by one of the methods described in examples 1 
and 2 were used. 

Dependence of the activity on the glycerol 
concentration was determined in a buffer containing 50 mM 
Tris pH 7.5, 2% CHAPS, 30 mM DTT. Increasing 
concentrations of glycerol were added to this buffer, and 
the relative protease activity was determined-r Figure 3 
shows the results of this experiment, indicating that 
50% (v/v) glycerol is the optimum level. In a subsequent 
experiment this concentration was kept constant at 50% 
and the concentration of CHAPS was varied (figure 4) . A 
level of 2% CHAPS (w/v) was in this way fo\ind to be the 
optimum concentration. It was possible to replace CHAPS 
with other detergents compatible with the need to 
maintain catalytic activity in the polypeptides according 
to the invention. Some of these detergents are: heptyl-P- 
D-glucopyranoside, decyl-p-D-glucopyranoside, decyl-P-D- 
glucomaltoside , nonyl-p-D-glucopyranoside , N-hexyl-P-D- 
glucopyranoside , octyl-p-D-glucopyranoside, octyl-p-D- 
thio-glucopyranoside, Nonidet P-40, TweeN-20. 

At optimum CHAPS and glycerol concentrations the 
protease shows optimal activity at pH 8.5 (figure 5), At 
this pH the stability over time is, however, lower than 
that seen at pH 7.5. To determine the effect of ionic 
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Strength on the activity, a titration was performed using 
NaCl. This experiment showed that protease activity is 
inhibited at a high ionic strength (figure 9) . Kinetic 
analysis of data showed that chloride ions are 
5 competitive inhibitors at concentrations of up to 100 mM. 

It was thus possible to define the following optimal 
conditions for in vitro assay of purified HCV protease 
activity: 50 mM Tris pH 7.5, 3-30 mM DTT, 2% CHAPS, 50% 
glycerol. Dependence of the activity on temperature was 

10 analysed by means of an Arrhenius plot in which the 
logarithm of the kinetic constant K^at is given as an 
inverse function of temperature. This graph shows 
discontinuity at temperatures above 25 °C, indicating 
changes in conformation simultaneously to the decrease in 

15 activity. The optimum temperature was thus determined to 
be around 22-23®C- 

As mentioned above, the protein NS4A is a cof actor 
of HCV protease. N and C- terminal deletion eacperiments 
have defined the peptide Pep4A with the sequence 

20 indicated in SEQ ID NO: 6, as the minimum domain still 
capable of inducing optimal activation. In transfection 
or in vitro translation experiments the addition of 
polypeptides containing the minimum NS4A sec[uence is 
essential to give effective cleavage. The addition of 

25 Pep4A is capable of inducing a significant increase in 
the activity of purified protease in the assay conditions 
described above. The kinetic characteristics of this 
activation are described below. Using a titration 
experiment a stoichiometry of 1:1 was determined for this 

30 interaction at a concentration of 3 00 nM. of protease, 
indicating a Kd<300 nM. 

Def inir.ion — of — the — optimal subst-ra tie foT ar!^,ivT^,y 
assay 

35 To define the minimum substrate whose cleavage can 

still be detected using the HPLC method described above, 
derivatives of the peptide Fmoc-Tyr-Gln-Glu-Phe-Asp-Glu- 
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Met-Glu-Glu-Cys-Ala-Ser-His-Leu-Pro-Tyr-Ile-Glu-Gln-Gly 
(SEQ ID NO: 7) described above were synthetized, with N- 
and/or C-terminal deletions. These peptides were 
inciibated in the conditions defined in the preceding 
5 chapter in the presence of 100 nM-1.6 \M protease. The 
nomenclature for the amino acid residues of the peptides 
used as substrates that is adopted in the following is 
that set down by Schechter and Berger in (7), The 
residues are defined as Pn . . . . P3 , P2 , Pi , Pi • , P2 ' , 

10 P3'. Pn' , where the hydrolysed bond is Pl-Pl« (bond 

between Cys and Ala) . Table 1 shows the kinetic data for 
this experiment, defining P6 and P3 « or P4 • as the 
extreme limits of a substrate that is still effectively 
cleaved. Deletions beyond P6 or P3 ' cause a drastic 

15 decrease in effectiveness, measured as kcat/K„ir with which 
the respective peptide can still act as a substrate. 
Deletion of P4 » causes a less marked decrease of kcat/K„/ 
however the separation of substrate and cleavage product 
by HPLC is significantly better for a decapeptide P6-P4 ' 

20 than for a nonapeptide P6-P3', so that the decapeptide 
P6-P4' has been defined the optimal substrate. 



Table 1: Characterisation of substrate 



Peptide 


K„. 


kcat 






(^M) 


(min" 


1) (M-ls-1) 


(SEQ ID NO: 7) Fmoc-YQEFDEMEECASHLPYffiQG 


53 


0.5 


143.0 


(SEQ ID NO: 8) Ac-YQEFDEMEECASHLPY 


56 


0.3 


87.0 


(SEQ ID NO: 9) Ac-YQEFDEMEECASHLP 


95 


0.4 


70.2 


(SEQ ID NO: 10) Ac-YQEFDEMEECASHL 


117 


0.4 


51.0 


(SEQ ID NO: 11) Ac-YQEFDEMEECASH 


197 


0.3 


24.0 


(SEQ ID NO: 12) Ac-YQEFDEMEECAS 


>1500 




11.1 


(SEQ ID NO: 13) Ac-YQEFDEMEECA 




no 


cleavage 


(SEQ ID NO: 14) Ac-DEMEECASHLPY 


171 


0.3 


34.0 


(SEQ ID NO: IS) Ac-EMEECASHLP 


3137 


0.3 


2.0 


(SEQ ID NO: 16) Ac-MEECASHL 




no 


cleavage 


(SEQ ID NO: 17) Ac-ECASHLPYIEQG 




no 


cleavage 


(SEQ ID NO: 1 8) Ac-DEMEECASHL 


100 


0.3 


47 
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{SEQIDNO:19)DEMEECASHL 85 0.1 22.7 

(SEQ ID NO:20) FmocDEMEECASHL 95 0. 1 23.8 

The kinetic parameters K^,, k„t and kc^t/Km were 
determined for decapeptides P6-P4' corresponding to the 
other two intermolecular cleavage sites NS4B/5A and 
NS5A/5B and this data was corr^ared with the data obtained 
using the peptide P6-P4» corresponding to the site 
NS4A/4B (table 2) . These kinetics were obtained both in 
the absence and in the presence of stechiometric 
concentrations of Pep4A. Analysis of the kinetic data 
obtained in this fashion indicates that Pep4A prevalently 
affects kcat' when the K„ values for the single substrates 
are compared it becomes evident that the presence of two 
negative charges in P5 and in P6 determined the bonding 
effectiveness of a peptide substrate. In fact 
decapeptides corresponding to the sites NS4A/4B auid 
NS5A/5B with Asp or Glu residues in position P6 and P5 
have K„ values similar and significantly lower than the 
peptide corresponding to site NS4B/5A with a single 
charge in position P6 . _ 
TABIiE 2 : Activity on peptides corresponding to cleavage 
sites in trans 

Peptide K„, 

(^M) 

NS4A/4B 

(SEQ ID NO: 1 8) Ac-DEMEECASHL 1 00 

(SEQ ID NO: 6) + pep4A 43 

NS4B/5A 

(SEQ ID NO:2I) Ac-DCSTPCSGSW 2100 
(SEQ ID NO: 6) '^pep4A 320 

NS5A/NS5B 

(SEQ ID NO:22) Ac-ED VVCCSMSY 310 
(SEQ ID NO: 6) +pep4A - 380 



(min-l) (M-ls-1) 

0.3 47.0 
1.4 540 



0.05 0.4 
0.8 4,2 



4.2 220 
15 650 
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Further investigation was carried out on the 
relative importance of single residues within the 
sequence P6-P4', corresponding to the cleavage site 
NS4A/4B, by mutating each amino acid singly to alanine 
and then determining the kinetic parameters for the 
mutcuit peptides obtained in this way. The results are 
described in table 3. This experiment identifies the 
following scale of importance of single residues for 
effective cleavage: P1>>P3=»P5«P6>P2=P4 . Modification of 
the P' part does not have a significant effect on the 
rate of cleavage. This information was used to develop 
protease activity assay methods, useful for the 
identification of inhibitors. These methods will be 
described below. 

TABLE 3. Replacement with alanine of residues P6-P4' of 
the peptide substrate 



Pepdde 


K„. 


kcat 






(^M) 


(min"l) 




(SEQ ID NO: 1 8) Ac-DEMEECASHL 


100 


0.3 


47.0 


(SEQ ID NO:23) Ac-AEMEECASHL 


150 


0.1 


9.4 


(SEQ ID NO:24) Ac-D AMEEC ASHL 


527 


0.3 


93 


(SEQ ID NO:2S} Ac-DEAEECASHL 


114 


0.1 


18.1 


(SEQ ID NO:26) Ac-DEMAECASHL 


322 


0.1 


7.2 


(SEQ ID NO:27) Ac-DEMEACASHL 


132 


0.1 


18.4 


(SEQ ID NO:28) Ac-DEMEEAASHL 




no cleavage 


(SEQ ID NO:29) Ac-DEN4EECAAHL 


129 


0.2 


32.5 


(SEQ ID NO:30) Ac-DEMEECASAL 


180 


0.3 


33.4 


(SEQ ID NCfc3 1) Ac-DEMEECASHA 


94 


0.1 


23.2 



For more detailed determination of the importcUice of 
the residues in P6 and PI', a series of peptides P6-P4' 
were synthetised in which modifications were introduced 
in these positions. The results of these experiments are 
described in table 4. The results of these experiments 
underline the importance of a negative charge in position 
P6. In fact. Asp or Glu in this position are accepted 
with indistinguishable K„. Neutralisation of the charge 
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by introduction of Asn causes a significant increase in 
K„,, whereas inversion of the charge by introduction of a 
Lys residue causes an extremely marked increase in K^. 
TABIjE 4. Substitution of residues P6 and PI' in the 
peptide substrate 



Peptide 










(tiM) 


(min"l) 




(SEQ ID NO: 1 8) Ac-DEMEECASHL 


100 


0.3 


47.0 


(SEQ ID NO:32) Ac-EEMEECASHL 


85 


0.2 


32.0 


(SEQ ID NO:33) Ac-NEMEECASHL 


427 


0.2 


7.7 


(SEQ ID NO:34) Ac-KEMEECASHL 


>1000 




3.1 


(SEQ ID NO:35) Ac-DEMEECSSHL 






27.2 


(SEQ ID NO:36) Ac-DEMEECFSHL 






1.1 


Svibstitution of Ala 


in position 


PI' with 


Ser has t 



significant effect, whereas sxxbstitution with Phe causes 
a reduction in the cleavage rate of the resulting 
substrate, measured as kcat/K™- 

Analysis was carried out on a series of mutations of 
the position PI, described in table 5. Substitution of 
cysteine in this position with threonine, alylglycine, a- 
aminobutyric acid, norvaline. and valine are accepted, 
even though the resulting substrates are cleaved with an 
efficiency, expressed as k^at/K™, which is significantly 
lower than that of the unmodified substrate. 
TABLE 5. Substitution of the peptide substrate residue PI 



Peptide substrate kca/Isn 

(M-ls-1) 

(SEQ ID NO:18) Ac-DEMEECASHL 47,0 
(SEQ ID NO:37) Ac-DEMEEAIgASHL 4.3 
(SEQ ID NO:38) Ac-DEMEEAbuASHL 1 .2 

(SEQ ID NO:39) Ac-DEMEETASHL 0,6 
(SEQ ID NO:40) Ac-DEMEENvaASHL 0.08 
(SEQIDNO:4l)Ac-DEMEEVASHL 0,05 



Alg, alylglycine; Abu, a-aminobutyrric acid; Nva, 
norvaline 
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10 



15 



20 



The information relating to substrate specificity 
can be used both for development of enzyme assays and for 
synthesis of inhibitors based on modifiied substrate 
sequences. For example, substrate peptides with modified 
PI residues are competitive inhibitors of protease with 
inhibition constants Ki of between 350 and 90 nM (table 
6) - These peptides can be further modified to increase 
their inhibitory power by introduction of aldehyde, 
trif luoromethylketone , dif luoromethylenketone , diketone , 
ketoester, ketoamide or a-ketoheterocyclic, boronic acid 
and monoalomethylketone groups. Information on 
specificity can also allow synthesis of inhibitors that 
are not based on peptides, such as: halo -enolac tones, 
isocoumarines , p-lactames , succinimides , pyrones , 
bezoxyazynones, bezoiso-thiazolines or latent 

isocyanatesi 

TABLE 6. Inhibitory action of decapeptides P6-P4' 
modified at position PI 



sidue PI 


Ki 


K« 






(jiM) 


Cys 




90 


Abu 


175 


189 


Alg 


165 


179 


Thr 


215 


180 


Val 


173 


not determined 


Ala 


173 


no cleavage 


Ser 


90 


no cleavage 


Gly 


191 


no cleavage 


Pro 


440 


no cleavage 


Cha 


350 


no cleavage 


aminobutyric acid; 


Alg, 


alylglycine; Cha, 



ciclohexylalanin . 

Method for U>sinq in vitro protfiri.qe arhw-i t v fo-r inhibitor 
research 

Automatic assay using an amide substrate 
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The peptide Ac-Asp-Glu-Met-Glu-Glu-Cys-Ala-Ser-His- 
Leu-Pro-Tyr-Lys-s-{'H)Ac, (SEQ ID NO:47) derived from the 
cleavage site NS4A/NS4B, is cleaved by the NS3 protease 
with the following kinetic parameters: K„ = 79 ^M, kcat - 
0.49 min'^ and k^at/K^ = 103 M'^ s'^. 400,000 cpm of the 
labelled peptide with a specific activity of 2-10 
Ci/mmol. were incubated for 3 hours at 23 °C together with 
40 fiM (K„/2) of unlabeled peptide in the presence of 200 
nM protease and 1 of Pep4A in 50 mM Tris pH 7.5, 50% 
glycerol, 3% CHAPS, 10 mM DTT- During this period 20* of 
the peptide substrate was cleaved. The cleavage product 
can be quantified following the method described below 
and summarised in figure 7. As can be seen from the 
figure, the mixture is placed in contact with a TSK-DEAE 
anionic exchanger. The fraction coming out of the 
exchanger is filtered, allowed to sediment or spun. The 
radioactivity is measured on the clear fraction, the 
amount of which is exclusively related to the right 
fragment (C- terminal) , given that the amide substrate and 
the left hand fragment remain bound to the anionic 
exchanger. The addition of inhibitors causes a decrease 
in the release rate of the labelled cleaved fragment. The 
more effective the inhibitor, the lower will be the 
radioactivity measured in the fraction coming out of the 
anionic exchanger. 

EXAMPT.-R R 

Synthesis of the depsinRntide gnHstiyate fli t 
Ac-ASP-Glu-Met-Glu-Glu-Ahu-it/rcOO}^ATa-a^T--Wi.c:-l.eii^Pr'n- 
Tvr-Lvs(N^>Ac) -NH^fSEn TH KTHrfl^ 

The synthesis was performed entirely on solid-phase 
using the continuous -flow Fmoc-polyamide method (9) . The 
protecting group combination was: base-labile Na-Fmoc for 
the a-amino group and acid-labile protection for the 
side-chains: Asp(Ot-Bu), Glu{Ot-Bu), Tyr(t-Bu) and 
His(trt). The polymer used was composite Kieselguhr- 
polyamide (9) derivatised with a modified Rink amide 
linker (10) , p- [ (r, s) -a- [l- (9H-Fluoren-9-yl) - 
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methoxyf ormamido] -2 , 4-dimetho3cybenzyl] -phenoxyacetic acid 
(11) (NovaSyn ® KR 125, 0.1 mmol/g) . The resin, amino 
acid derivatives, activating agents and all other 
reagents were of the highest available grade from 
5 commercial sources. The synthesis was rxin according to 
the scheme given in figure 8 . Couplings were performed 
with 5- fold excess of activated amino acid over the resin 
free amino groups, using Fmoc-amino acid/PyBOP/HOBt/DIEA 
(1:1:1:2) activation, except for L- (+) -lactic acid where 

10 Fmoc-amino acid/DIPC/HOBt (1:1:1:1) activation was used. 
Esterif ication of Abu to the free hydroxyl of lactic acid 
was performed using the symmetrical anhydride (Fmoc- 
Abu)20 in the presence of a catalytic amount (0.1 equiv.) 
of DMAP, for 3 0 minutes at room temperature (12) : the 

15 reaction was repeated twice to achieve 90% yield; in the 
absence of catalyst, the remaining free hydroxyls are 
unreactive in subsequent synthetic operations. At the end 
of the assembly, the resin was washed with DMF, methanol 
and CH2CI2/ then dried In vacuo for 16 hours. The dry 

20 peptide -resin was treated with TFA/water/ 

triisopropylsilane (92.5:5:2.5) for 1.5 hours at room 
temperature; the resin was filtered out and the peptide 
precipitated with cold 'methyl t-Bu ether; the precipitate 
was redissolved in 50% water/acetonitrile containing 0.1% 

25 TFA and lyophilised. 

Purification to >98% homogeneity was achieved 
through preparative HPLC on a Nucleosyl C-18 column 
(250x21 mm, 7 ^M) using as eluents (A) water and (B) 
acetonitrile with 0.1% TFA, and a step gradient 22%B over 

30 5 minutes, then 22-27%B over 25 minutes, flow rate 12 
ml/min. In these conditions the peptide elutes at 21.9 
minutes. The fractions containing the pure material were 
pooled and lyophilised: yield 35%. 

EXAMPT.F 

35 Chemical synthesis of tbp deps-Fpeptid^ allha ^.yate S2 ; 

Ac-ASP-Glu-Met-Glu-Glu-ThT-.i(/- rCQOI -Ala-Sg^T-- Hig-I.g^ii-Pyn- 
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The synthesis was performed as described in the 
previous example. Esterif ication of Thr to lactic acid 
required three repetitions to obtain a 70% yield, which 
was also accompanied by 3% racemization of the Thr 
residue. The D-Thr diastereoisomer was however 
chromatographically well resolved from the L- isomer, said 
easily resolved by preparative HPLC. The gradient used 
was 21%B over 5 minutes, then 21-22%B over 2 0 minutes, 
with the desired peptide eluting at 19.7 minutes: yield 
24%. 

EXAMPLE 7 

Svnthesi.g; of the radioactive depsir>er>f -i Hp> subg^,'ra1^:g> fii r 
Ac-Asp-Glu-Met-Glu-Glu-Abu-vi;- TCOOl - Al a -fi^-r^Hi s>T.f>M-PT-r^- 



To selectively label peptide SI on the if-amino 
group of the C- terminal lysine, the protected precursor 
Ac-Asp (Ot-Bu) -Glu{Ot-Bu) -Met-Glu(Ot-Bu) -Glu(Ot-Bu) -Abu- 
\|/[CX)0] -Ala-Ser (t-Bu) -His (Trt) -Leu-Pro-Tyr (t-Bu) -Lys-CONHa 
was assembled on the resin according to the scheme of 
figure 10 . The only variation with respect to the 
synthesis of (jf-Ac) -SI was the use of Fmoc-Lys (Alloc) -OH 
instead of Fmoc-Lys (N*- Ac) -OH. The Alloc protection is 
orthogonal with respect to Fmoc and t-Bu based prot;ecting 
groups, being removed with a two hour treatment with (0) 
PdP[(Ph3)4] in a solution of CHCI3 containing 5% acetic 
acid and 2.5% N-methylmorpholine . 

The dry peptide-resin (0.07 mmol/g, 60 mg) was 
reacted with [^H] acetic anhydride (25 mCi, 5.7 mCi/mmol) 
for 16 hours at room temperature. A 10-fold excess of 
non-radioactive acetic anhydride was then used to 
complete the reaction. The resin was then washed with DMF 
and treated as previously described. After preparative 
HPLC, >98% pure peptide Ac-Asp-Glu-Met-Glu-Glu-Abu-xj/- 
[COO] -Ala-Ser-His-Leu-Pro-Tyr-Lys (N*- [^H] -CH3CO) -NHj was 
obtained with a specific activity of 0.68 mCi/mmol. 
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Using the HPLC-based assay, the following kinetic 
parameters were obtained for the radioactive depsipeptide 
sxibstrate SI (SEQ ID NO: 44) : 

K„(^iyi) = 11 

Keat/Kn,(M'^s"^) = 13.636 

Using the same assay, the kinetic parameters for the 
radioactive substrate S2 are 
Kcat(niin"^)«16 
K„{jLiM) = 96 
Kcat/Kt„(M'^s'^) = 2.780, 

Synthesis of the radioactive depsipeptide substrates 
allows set-up of a high- throughput assay for 
determination of NS3 protease activity as schematically 
illustrated in figure 11. The principle is the following: 
both the intact substrate and the N- terminal fragment 
that originates from enzyme cleavage (Ac-Asp-Glu-Met-Glu- 
Glu-Abu-OH) are extremely acid, whereas the C-t^rminal 
fragment [HO-CHCCHa) CO-Ser-His-Leu-Pro-Tyr-Lys (if - [^H] - 

CH3CO) -NH2] is, according to pH, neutral or basic. It is 
therefore possible to capture the two acidic species on 
an anionic exchange resin, leaving the C-terminal 
fragment in solution. If the C-terminal fragment contains 
a radioactive marker (in this case the tritiated acetate 
covalently bonded to the e-amino group of the C-terminal 
lysine) , the resin will be able to discriminate processed 
substrate from non-processed substrate, thus making it 
possible to quantify proteolytic activity by measuring 
the amoimt of radioactivity remaining in solution after 
incubation with the enzyme and treatment with the ion 
exchanger. The whole process is essentially the same used 
in the high -throughput assay based on the amide substrate 
of example 4, but the pH used in this case is 7.0. instead 
of 7.5 to minimise spontaneous hydrolysis of the ester 
bond (0.6%/hour at 23«C) . 

EXAMPIt?; a 

Synthesis of the depsipeptide substrates S3 and S4 : 
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Ac-Asp-Glu~Asp- (EDANS) -Glu-Glu-Abu-\|/ [COO} -Ala-Ser-Lys- 
(DABCYL)NH2 (SEQ ID NO:45) and Ac-Asp-Asp- (EDANS) -Met- 
Glu-Glu-Abu-v [COO} -Ala-Ser-Lys (DABCYL) NHj (SEQ ID N0:46) 
The chemical formula of the two substrates S3 and S4 
5 is shown in figure 11. 

The synthesis was performed on solid phase as 
detailed in the scheme of figure 13 for S3 (SEQ ID 
NO: 45) , making use of two special derivatives, Fmoc- 
Asp{EDANS) -OH and Fmoc-Lys (DABCYL) -OH, prepared according 
10 to known methods (16-17) . All the couplings, including 
Asp(EDANS) and Lys (DABCYL) , were performed with 5-fold 
excess of activated amino acid over the resin free amino 
groups, using Fmoc- amino acid/PyBOP/HOBt/DIEA (1:1:1:2) 
activation, with the exception of L- ( + ) -lactic acid where 
15 Fmoc-amino acid/DIPC/HOBt (1:1:1,1) activation was used. 
Esterif ication of Abu to the free hydroxyl of lactic acid 
was performed using the symmetrical anhydride (Pmoc- 
Abu)20 in the presence of a catalytic amount (0-1 equiv.) 
of DMAP, for 3 0 minutes at room temperature (12) : the 
20 reaction was repeated twice to achieve 92% yield. At the 
end of the assembly, the peptide-resin was washed and the 
peptide cleaved as described for substrate SI. 

Purification to >98% homogeneity was achieved 
through preparative HPLC on a Nucleosyl C-18 column 
25 (250x21 mm, 7^m) using as eluents (A) 50 mM ammonium 
acetate, pH 6 and (B) acetonitrile . The gradient used for 
both S3 and S4 was 20%B over 5 minutes, then 20-40%B over 
20 minutes, flow rate 20 ml/min; the fractions containing 
the pure material were pooled and lyophilised: yield 45% 
30 and 35% for S3 and S4, respectively. The kinetic . 
parameters for this substrate, evaluated through the 
HPLC-based assay (see figure 14A) , were the following: 
Kcat(niin'^)=:3.51 
K„(HM) = 10.95 
35 Kcat/Kn,{M'^s"^) = 5342, 

The buffer used for the assay is the following: 33 
mM DTT, 50 mM Tris, pH 7, 50% glycerol, 2% CHAPS. The 
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incxibation is carried out at pH 7.0 to minimise 
spontaneous hydrolysis of the ester bond. The assay can 
be run in a cuvette or in a (96-well) microtitre plate, 
monitoring the fluorescence as a function of time 
5 (Excitation wavelength 355 nM, Emission wavelength 495 
nM) . The increase in fluorescence upon substrate cleavage 
is 13 -fold. The reaction is linear as shown in figure 14B 
(fixed substrate concentration = 2 jiM) • The detection 
limit was established as 1 nM for the high- throughput 

10 raicroplate assay and 520. pM for the HPLC-based assay. If 
a continuous (cuvette) assay is performed to establish 
initial rates for the enzymatic reaction, the lower limit 
for enzyme concentration is 80 nM, because of 
fluorescence quenching of the cleaved substrate at 

15 substrate concentrations higher than lOjiM. 

DEPOSITS 

Strains of E. coli DHl - transformed using the 
plasmids pBac (1039-1226), 'pT7-7 (1039-1226), pT7-7 

20 (1039-120_6) , pT7-7 (1027-1206) and pT7-7 (1033-1206) 

coding, respectively, for the polypeptides with amino 
acid sequence SEQ ID NO:l, SEQ ID NO: 2, SEQ ID NO: 3, SEQ 
ID NO: 4 and SEQ ID NO: 5 - were deposited on 14 August 
1995 with The National Collections of Industrial and 

25 Marine Bacteria Ltd. (NCIMB) , Aberdeen, Scotland, U.K., 
with access numbers NCIMB 40761, NCIMB 40762, NCIMB 
40763, NCIMB 40764 and NCIMB 40765. 
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ABBREVTATTOlsrS AND SYMROT.S TlflPn TN THP! TFYT 

Abu = 2-aminobutyric acid; CHAPS = 3 - [ {3-colammide- 
propyl) -dimethyl -ammonium] -1-propansulphonate; DABCYL = 
4- t [4 ' - (dimethylaminophenyl] azo] benzoic acid; 

20 Depsipeptide = a peptide where at least one peptide 
bond is replaced by the corresponding ester bond (the 
location (s) of the ester bond{s) within the molecule is 
usually indicated as \|/ [COO] - between the amino acid 
residues involved) ; DIEA = N,N-diisopropylethylamine; 

25 DIPC = N,N» -diisopropylcarbodiimide; DMAP = 4- 
dimethylaminopyridine; DMF = N,N-dimethylformmamide; DTT 
dithiothreitol ; EDANS = 5- [ (2 ' - 

aminoethyl) amino] naphthalenesulfonic acid; EDTA * 
ethylendiammino-tetracetic acid; HOBt = N- 

30 hydroxybenzotriazole; HPLC = high-performance liquid 
chromatography; PyBOP = Benzotriazole-l-yl-oxy-tris- 
pyrrolidino-phosphonium hexaf luorophosphate; RET 
resonance energy transfer; t-Bu = t ertiary- butyl ; TEA = 
trif luoroacetic acid; Trt (Trityl) = triphenylmethyl . 
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SEQUENCE LISTING 
GENERAL INFORMATION 
(i) APPLICANT: ISTITUTO DI RICERCHE DI BIOLOGIA 
MOLECOLARE P. ANGELETTI S.p.A. 
5 (ii) TITLE OF INVENTION: METHODOLOGY TO PRODUCE, 

PURIFY AND ASSAY POLYPEPTIDES WITH THE 
PROTEOLITIC ACTIVITY OF THE HCV NS3 PROTEASE 

(iii) NUMBER OF SEQUENCES: 47 

(iv) CORRESPONDENCE ADDRESS: 

10 (A) ADDRESSEE: Societa Italiana Brevetti 

(B) STREET: Piazza di Pietra, 3 9 

(C) CITY: Rome 

(D) COUNTRY: Italy 

(E) POSTAL CODE: 1-00186 
15 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 3.5" 1.44 
MBYTES 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS Rev. 6 ,22 
20 _ (D) SOFTWARE: Microsoft Word 6.0 

(viii) ATTORNEY INFORMATION 

(A) NAME: DI CERBO, Mario (Dr.) 
(C) REFERENCE: RM/X88568/PC-DC 

(ix) TELECOMMUNICATION INFORMATION 
25 (A) TELEPHONE : 06/6785941 

(B) TELEFAX: 06/6794692 

(C) TELEX: 612287 ROPAT 
(1) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS 

30 (A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Met Gly Leu Leu Gly Cy& lie lie Thr Ser Leu Thr Gly Arg Asp Lys 
1 5 10 15 
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Asn Gin Val Glu Gly Glu Val Gin Val Val Ser Thr Ala Tlir Gin Ser 

20 25 30 

Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr Val Tyr His Gly 
35 40 45 

5 Ala Gly ser Lys Thr Leu Ala Gly Pro Lys Gly Pro He Thr Gin Met 
50 55 60 

Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Gin Ala Pro Pro Gly 
65 70 75 80 

Ala Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu 
10 85 90 95 

Val Thr Arg His Ala Asp Val He Pro Val Arg Arg Arg Gly Asp Ser 

100 105 110 

Arg Gly Ser Leu Leu Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser 
115 120 125 

15 Ser Gly Gly Pro Leu Leu Cys Pro Ser Gly His Ala Val Gly He Phe 
130 135 140 

Arg Ala Ala Val Cys Thr Arg, Gly Val Ala Lys Ala Val Asp Phe Val 
145 150 155 160 

Pro Val Glu Ser Met Glu Thr Thr Met Arg Ser Pro Val Phe Thr Asp 
20 165 170 175 

Asn Ser Ser Pro Pro Ala Val Pro Gin Ser Phe Gin Val Ala Leu 

180 185 190 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS 

25 (A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE:' 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO; 2: 

Met Ala Arg lie Arg Ala Leu Leu Gly Cys He He Thr Ser lieu Thr 
1 5 10 15 

Gly Arg Asp Lys Asn Gin Val Glu Gly Glu Val Gin Val Val Ser Thr 
20 25 30 

35 Ala Thr Gin Ser Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr 
35 40 45 

Val Tyr His Gly Ala Gly Ser Lys Thr Leu Ala Gly Pro Lys Gly Pro 
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50 55 60 

lie Thr Gin Met Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Gin 
65 70 75 80 

Ala Pro Pro Gly Ala Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser 

85 90 95 

Asp Leu Tyr Leu Val Thr Arg His Ala Asp Val lie Pro Val Arg Arg 

100 105 110 

Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro Val Ser Tyr 

115 120 125 

Leu Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ser Gly His Ala 

130 135 140 

Val Gly lie Phe Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala 
145 150 155 160 

Val Asp Phe Val Pro Val Glu Ser Met Glu Thr Thr Met Arg Ser Pro 

165 170 175 

Val Phe Thr Asp Asn Ser Ser Pro Pro Ala Val Pro Gin Ser Phe Gin 
180 185 190 

Val Ala Leu 

195 

(3) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
Met Ala Arg lie Arg Ala Leu Leu Gly Cys lie lie Thr Ser Leu Thr 
1 5 10 15 

Gly Arg Asp Lys Asn Gin Val Glu Gly Glu Val Gin Val Val Ser Thr 

20 25 30 

. Ala Thr Gin Ser Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr 
35 40 45 

Val Tyr His Gly Ala Gly Ser Lys Thr Leu Ala Gly Pro Lys Gly Pro 

50 55 60 

lie Thr Gin Met Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Gin 
65 70 75 80 
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Ala Pro Pro Gly Ala Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser 

85 90 95 

Asp Leu Tyr Leu Val Thr Arg His Ala Asp Val lie Pro Val Arg Arg 

100 105 110 

Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro Val Ser Tyr 

lis 120 125 

Leu Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ser Gly His Ala 

130. 135 140 

Val Gly lie Phe Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala 
145 150 155 160 

Val Asp Phe Val Pro Val Glu Ser Met Glu Thr Thr Met Arg 

165 170 
(4) INFORMATION FOR SEQ ID NO: 4: 

(l) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4_: 
Met Ala Pro lie Thr Ala Tyr Ser Gin Gin Thr Arg Gly Leu Leu Gly 
1 5 10 15 

Cys - lie lie Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu Gly 

20 25 30 . 

Glu Val Gin Val Val Ser Thr Ala Thr Gin Ser Phe Leu Ala Thr Cys 

35 40 45 

Val Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Ser Lys Thr 

50 55 60 

Leu Ala Gly Pro Lys Gly Pro lie Thr Gin Met Tyr Thr Asn Val Asp 
65 70 75 80 

Gin Asp Leu Val Gly Trp Gin Ala Pro Pro Gly Ala Arg Ser Leu Thr 

85 90 95 

Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala 

100 105 110 

Asp Val lie Pro- Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu 

115 120 125 

Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu 
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130 135 140 

Leu Cys Pro Ser Gly His Ala Val Gly lie Phe Arg Ala Ala Val Cys 
145 150 155 

Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val Pro Val Glu Ser Met 
5 165 170 175 

Glu Thr Thr Met Arg 
180 

(5) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS 

10 (A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : s ingl e 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

Ser Gin Gin Thr Arg Gly Leu Leu Gly Cys lie lie Thr Ser Leu Thr 
1 5 10 15 

Gly Arg Asp Lys Asn Gin Val Glu Gly Glu Val Gin Val Val Ser Thr 
20 25 30 

20 Ala Thr Gin Ser Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr 
35 40 45 

Val Tyr His Gly Ala Gly Ser Lys Thr Leu Ala Gly Pro Lys Gly Pro 

50 55 60 

lie Thr Gin Met Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp Gin 
25 65 70 75 80 

Ala Pro Pro Gly Ala Arg Ser Leu Thr Pro Cys Thr Cys Gly Ser Ser 

85 90 95 

Asp Leu Tyr Leu Val Thr Arg His Ala Asp Val lie Pro Val Arg Arg 
100 105 110 

30 Arg Gly Asp Ser Arg Gly Ser Leu Leu Ser Pro Arg Pro Val Ser Tyr 
115 120 125 

Leu Lys Gly Ser Ser Gly Gly Pro Leu Leu Cys Pro Ser Gly His Ala 

130 135 140 

val Gly lie Phe Arg Ala Ala Val Cys Thr Arg Gly Val Ala Lys Ala • 
35 145 150 155 - 160 

Val Asp Phe Val Pro Val Glu Ser Met Glu Thr Thr Met Arg 
165 170 



wo 97/08304 PCT/IT96/0Dld3 

' 36 ' 

(6) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
{ ix) FEATURE : 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
Gly Ser Val Val He Val Gly Arg He He Leu Ser Gly Arg 
1 5 10 

(7) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 20 amino acids 
. (B)TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Fmoc-Tyr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser His Leu Pro Tyr 
^ 5 10 15 

He Glu Gin Gly 
20 

(8) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Tyr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser His Leu Pro Tyr 
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1 5 xo 

(9) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ix) FEATURE: 

(A) NAME : Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Tyr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser His Leu Pro 

^ 5 xo 15 

(10) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Tyr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser His Leu 

^ 5 10 

(11) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
( ix) FEATURE : 

tA)NAME: Peptide 
(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Tyr 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser His 
1 5 10 

(12) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Tyr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala Ser 
1 5 -10 

(13) INFORMATION FOR SEQ ID NO : 13: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 11 amino acids 
{B)TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Tyr 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
Xaa Gin Glu Phe Asp Glu Met Glu Glu Cys Ala 
1 5 10 

(14) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 
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(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu Pro Tyr 
1 5 10 

(15) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS . 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Glu 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
Xaa Met Glu Glu Cys Ala Ser His Leu Pro 
1 5 10 

(16) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
( ix) FEATURE : 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Met 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
Xaa Glu Glu Cys Ala Ser His Leu 
1 5 

(17) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ix) FEATURE: 

(A) NAME : Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Glu 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
Xaa Cys Ala Ser His Leu Pro Tyr lie Glu Gin Gly 
1 5 10 

(18) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 
{B)TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu 
1 5 10 

(19) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(OSTRANiDEDNESS: single 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 

(D) FURTHER INFORMATION: 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 19: 
Asp Glu Met Glu Glu Cys Ala Ser His Leu 
1 5 10 

(20) INFORMATION FOR SEQ ID NO : 20: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
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(OSTRANDEDNESS: single 
(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Fmoc-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu 
1 5 10 

(21) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(OSTRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
Xaa Cys Ser Thr Pro Cys Ser Gly Ser Val 
1 5 10 

(22) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(OSTRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Glu 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
Xaa Asp Val Val Cys Cys Ser Met Ser Tyr 
1 5 10 

(23) INFORMATION FOR SEQ ID NO: 23: 
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(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 
{B)TyPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Ala 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu 
1 5 10 

(24) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
Xaa Ala Met Glu Glu Cys Ala Ser His Leu 
1 5 10 

(25) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
Xaa Glu Ala Glu Glu Cys Ala Ser His Leu 



wo 97/08304 



- 43 - 



PCT/IT96/00163 



1 5 10 

(26) INFORMATION FOR SEQ ID NO : 26: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
( ix) FEATURE : 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
Xaa Glu Met Ala Glu Cys Ala Ser His Leu 
1 5 10 

(27) INFORMATION FOR SEQ ID NO: 27: 
(±) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TyPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
Xaa Glu Met Glu Ala Cys Ala Ser His Leu 
1 5 10 

(28) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
Xaa Glu Met Glu Glu Ala Ala Ser His Leu 
1 5 10 

(29) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Xaa Glu Met Glu Glu Cys Ala Ala His Leu 

1 5 10 

(3 0) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid_ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Xaa Glu Met Glu Glu Cys Ala Ser Ala Leu 

1 5 . 10 

(31) INFORMATION FOR SEQ ID NO : 31: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

( C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 
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(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
Xaa Glu Met Glu Glu Cys Ala Ser His Ala 
5 1 5 10 

(32) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

10 (C)STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

15 (D) FURTHER INFORMATION: Xaa is Ac-Glu 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu 
15 10 

(33) INFORMATION FOR SEQ ID NO: 33: 
20 (i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TyPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
25 (ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asn 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
30 Xaa Glu Met Glu Glu Cys Ala Ser His Leu 
1 5 10 

(34) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 
35 (B)TYPE: amino .acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Lys 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu 
15 10 

(35) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 
{ B ) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
Xaa Glu Met Glu Glu Cys Ser Ser His Leu 
1 ^ 5 10 

(36) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
Xaa Glu Met Glu Glu Cys Phe Ser His Leu 
15 10 

(37) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Al< 
(alylglycine) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
Xaa Glu Met Glu Glu Xaa Ala Ser His Leu 
1 5 10 

(38) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Abu (a- 
amminobutyric acid) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
Xaa Glu Met Glu Glu Xaa Ala Ser His Leu 
1 5 10 

(39) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
Xaa Glu Met Glu Glu Thr Ala Ser His Leu 
^ 5 XO 

(40) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 
{B)TYPE: amino acid 
(OSTRANDEDNESS: single 
(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(ix) FEATURE: 

(A) NAME : Peptide 

(B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Nva (norvaline) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
Xaa Glu Met Glu Glu Xaa Ala Ser His Leu 

(41) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

. (A) NAME: Peptide 
(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
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Xaa Glu Met Glu Glu Val Ala Ser His Leu 
1 5 10 

(42) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS 

5 (A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

10 (A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(ix) FEATURE: 

(A) NAME: Peptide 
15 (B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Abu (2- 
amminobutyric acid) ester bonded to the following 
residue 
(ix) FEATURE: 
20 _ (A) NAME: Peptide 

(B) POSITION: 7 

(D) FURTHER INFORMATION: Xaa is Ala ester bonded 
to the adjacent preceding residue 
(ix) FEATURE: 
25 (A) NAME: Peptide 

(B) POSITION: 13 

(D) FURTHER INFORMATION: Xaa is Lys (N*-Ac) -NHj 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
Xaa Glu Met Glu Glu Xaa Xaa Ser His Leu Pro Tyr Xaa 
30 1 5 10 

(43) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 
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(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(ix) FEATURE: 
5 (A) NAME: Peptide 

(B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Thr ester bonded 
to the adjacent following residue 
(ix) FEATURE: 
10 (A) NAME: Peptide 

(B) POSITION: 7 

(D) FURTHER INFORMATION: Xaa is Ala ester bonded 
to the adjacent preceding residue 
(ix) FEATURE: 
15 (A) NAME: Peptide 

(B) POSITION: 13 

(D) FURTHER INFORMATION: Xaa is Lys (N*-Ac) -NHj 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
Xaa Glu Met Glu 61u Xaa Xaa Ser His Leu Pro Tyx Xaa 
20 1 5 10 

(44) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 13 amino acids 
{B)TYPE: amino acid 
25 (C)STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

30 (D) FURTHER INFORMATION: Xaa is Ac-Asp 

(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Abu (2- 
35 amminobutyric acid) ester -bonded to the adjacent 

following residue 
(ix) FEATURE: 
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(A) NAME: Peptide 

(B) POSITION: 7 

(D) FURTHER INFORMATION: Xaa is Ala ester bonded 
to. the adjacent preceding residue 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 13 

(D) FURTHER INFORMATION: Xaa is Lys (N^- [^H] 0 - 
CH3CO) -NH2 

(xi) SEQXIENCE DESCRIPTION: SEQ ID NO: 44: 
Xaa Glu Met Glu Glu Xaa Xaa Ser His Leu Pro Tyx Xaa 
1 5 10 

(45) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 3 

(D) FURTHER INFORMATION: Xaa is Asp (EDANS) 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Abu (2- 
amminobutyric acid) ester bonded to the following 
residue 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 7 

(D) FURTHER INFORMATION: Xaa is Ala ester bonded 
to the adjacent preceding residue 
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(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 9 

(D) FURTHER INFORMATION: Xaa is Lys (DABCYL) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
Xaa Glu Xaa Glu Glu Xaa Xaa Ser Xaa 
1 5 

(46) INFORMATION FOR SEQ ID NO: 46: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

( C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
( ix) FEATURE : 

(A) NAME : Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac-Asp 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 2 

(D) FURTHER INFORMATION: Xaa is Asp (EDANS) 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 6 

(D) FURTHER INFORMATION: Xaa is Abu (2- 
amminobutyric acid) ester bonded to the following 
residue 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 7 

(D) FURTHER INFORMATION: Xaa is Ala ester bonded 
to the adjacent preceding residue 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 9 

(D) FURTHER INFORMATION: Xaa is Lys (DABCYL) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



wo 97/08304 



- 53 - 



PCT/IT96/00163 



Xaa Xaa Met Glu Glu Xaa Xaa Ser Xaa 
1 5 

(47) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 13 amino acids 
' (B)TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 
( ix) FEATURE : 

(A) NAME: Peptide 

(B) POSITION: 1 

(D) FURTHER INFORMATION: Xaa is Ac -Asp 
(ix) FEATURE: 

(A) NAME: Peptide 

(B) POSITION: 13 

(D) FURTHER INFORMATION: Xaa is Lys-8-{^H)Ac 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
Xaa Glu Met Glu Glu Cys Ala Ser His Leu Pro Tyr Xcta 
1 5 10 
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CLAIMS 

1. Isolated polypeptides, characterised in that they 
consist of an amino acid sequence chosen from the group 
coinprising SEQ ID NO:l, SEQ ID NO: 2, SEQ ID N0:3,.SEQ ID 

5 NO: 4 and SEQ ID NO: 5, and in that they have the 
proteolytic activity of the HCV virus NS3 protein. 

2. Expression vectors, for the production of one of 
the polypisptides according to claim 1 in a host orgauiism, 
comprising: 

10 - a polynucleotide coding for one of said 

polypep t ides ; 

functional regulation, transcription and 
translation sequences within said host organism, 
operatively bonded to said polynucleotide; and 

15 - optionally, a selection marker, 

3. Host cell, either eukaryotic or prokaryoti'c , 
transformed using an expression vector according to claim 

■2, capable of expressing the specific polypeptide coded 
in the chosen polynucleotide sequence. 
20 . 4. A process for preparing one of the polypeptides 

according to claim 1, characterised by the fact that it 
comprises, in combination, the following operations: 

- transformation of a host cell, either eukaryotic 
or prokaryotic, using an expression vector containing a 

25 DNA sequence coding for a polypeptide chosen from the 
group of sequences indicated in SEQ ID N0:1, SEQ ID NO: 2, 
SEQ ID NO: 3, SEQ ID NO : 4 and SEQ ID NO: 5; 

- expression of the desired DNA sec[uence to produce 
the chosen polypeptide; and 

30 - purification of the polypeptide thus obtained, 

avoiding resolubilisation protocols. 

5. Peptides, characterised in that they consist of 
an amino acid sequence chosen from the group of sequences 
indicated in SEQ ID N0S:7-12, 14, 18-20, 29-32, 35 and 

35 47, and by the fact that they can be used as substrates 
in a high- throughput assay of the in vitro activity of 
polypeptides having HCV NS3 proteolytic activity. 
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6, Depsipeptides, characterised in that they consist 
of an amino acid sequence chosen from the group of 
sequences indicated in SEQ ID NOS:42-46, and by the fact 
that they can be used as substrates in a high- throughput 

5 assay of the in vitro activity of polypeptides having HCV 
NS3 proteolytic activity. 

7, A method for reproducing and effectively assaying 
•in vitro the proteolytic activity of the HCV NS3 protein'/ 
characterised by the fact that the activity of the 

10 polypeptides according to claim 1 is reproduced and 
tested in a solution containing 30-70 mM Tris pH 6.5-8.5, 
3-30 mM dithiotreitol , 0.5-3% 3- [ (3-colammide-propyl) - 
dimethyl -ammonium] -1-propansulphonate and 30-70% glycerol 
at temperatures of between 2 0 and 25° C, in a high- 

15 throughput assay, using as siibstrates the peptides of 
claims 5 or the depsipeptides of claim 6. 

8. The method for reproducing and effectively 
assaying in vitro the proteolytic activity of HCV NS3 
according to claim 7, in which the peptides of claim 5 

20 are used in a high- throughput assay at xroncentrations of 
the polypeptides according to claim 1 of between 100 and 
200 nM. 

9. The method for reproducing and effectively 
assaying in vitro the proteolytic activity of HCV NS3 

25 according to claim 7, in which the depsipeptides of 
claim 6 are used in a high- throughput assay at 
concentrations of the polypeptides according to claim 1 
of between 0.5 and 2 nM. 

10 - The method for reproducing and effectively 

30 assaying in vitro the proteolytic activity of HCV NS3 
according to claim 9, in which continuous monitoring of 
the proteolytic activity of the polypeptides of claim 1 
is carried out by use of depsipeptides chosen from the 
group of secjuences represented by SEQ ID NO: 45 and SEQ 

35 ID NO 1-46 as sxibstrates, with internal fluorogenic 
quenching by "Resonance Energy Transfer" between a 
fluorescent donor, 5- [ (2 ' -aminoethyl) amino] naphthalene- 
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sulfonic acid (EDANS) , close to one end of the 
depsipeptide, and an acceptor group, 4-[[4'- 
(dimethylaminophenyl] azo] benzoic acid (DABCYL) close to 
the other end of the depsipeptide. 
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lacZ(fi-gal) 



BamHJ 



NS3 (1039-1226) 



pBacNSS 



(1039-1226) 




Sad 



SphI 



P ETL = promoter of the gene encoding the PCNA protein 
P PH = polyhedrin promoter 

Amp = gene encoding B-Iactamase (Ampidllin resistancej 
LacZ (fi-gal) = gene encoding B-galactosidase 
Col El = pBR322 origin of replication 



Fig. 1 



wo 97/08304 



PCT/IT96/00163 



2/13 

pT7-7NS3(1039.1226) 




pT7-7NS3(io39-i206) 




010 = 010 promoter of bacteriophage T7 

rbs = Shine-Dalgarno ribosome binding sequence 

ATG = translation initiation site of the protein encoded by 
by gene 10 of bacteriophage T7 

B-lactamase = gene encoding B-lactamse (Ampicillin resistance) 
Col El =: pBR322 origin of replication 



Fig.2a 
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pT7-7NS3(i027-i206) 




pT7-7NS3(i033-i206) 



NdcX 




010 = 010 promoter of bacteriophage T7 

rbs = Shine-Dalgarno ribosome binding sequence 

ATG = translation initiation site of the protein encoded by 
by gene 10 of bacteriophage T7 

B-Iactamase = gene encoding B-lactamse (Ampicillin resistance) 
Col El = pBR322 origin of replication 



Fig. 2b 
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Glycerol dependence of NS3 activity 
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Fig.3 




Fig. 4 
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Ionic strength dependence of NS3 activity 
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